Exploiting related digital library corpora with query rewriting

نویسندگان

  • Federica Mandreoli
  • Riccardo Martoglia
چکیده

In this paper, we present the preliminary results of the ongoing research activity we are carrying out in the context of approximate XML query answering when the schemas of the XML documents are available. The method we propose involves a preliminary schema matching process, which automatically identifies the semantic and structural similarities between the schema elements to be used in the subsequent operation of query rewriting, in which a query written on a source schema is automatically rewritten in order to be compatible with the other useful XML documents. The proposed approach has been implemented in a web service, named XML SMART, which is part of the open architecture proposed in the ongoing Italian CNR co-funded ECD Project.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting the Web as the multilingual corpus for unknown query translation

Users’ cross-lingual queries to a digital library system might be short and the query terms may not be included in a common translation dictionary (unknown terms). In this paper, we investigate the feasibility of exploiting the Web as the multilingual corpus source to translate unknown query terms for cross-language information retrieval in digital libraries. We propose a Web-based term transla...

متن کامل

Exploiting Extended Service-Oriented Architecture for Federated Digital Libraries

In order to support various requirements from the user’s perspective, digital library (DL) systems may need to apply a large variety of services, such as query services for a specific DL, mapping services for mapping and integrating heterogeneous metadata records, or query modification and expansion services for retrieving additional relevant documents. This paper focuses on exploiting an exten...

متن کامل

Redundant Communication Elimination Optimization in XQuery for Multimedia Contents Management using Metadata in Distributed DL Environments

Recent multimedia digital library where various media have to be managed in distributed environments, metadata play an important role. XQuery, a standard language for querying XML, is expected to integrate multimedia contents through metadata written in XML. In this type of query, performance is significantly affected by external document references over network which are currently implemented ...

متن کامل

Semantics and Pragmatics of Preference Queries in Digital Libraries

As information becomes available in increasing amounts, and to growing numbers of users, the shift towards a more user-centered, or personalized access to information becomes crucial. In this paper we consider the semantics and pragmatics of preference queries over tables containing information objects described through a set of attributes. In particular, we address two basic issues: – how to d...

متن کامل

The Effects of the Relevance-Based Superimposition Model in Cross-Language Information Retrieval

We propose a cross-language information retrieval method that is based on document feature modification and query translation using a dictionary extracted from comparable corpora. In this paper, we show the language-independent effectiveness of our document feature modification model for dealing with semantic ambiguity, and demonstrate the practicality of the proposed method for extracting mult...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004